Corpus: zho_news_2021_100K

Other corpora

3.8.2 Number of letter-N-grams at word endings

Number of letter-N-grams for N=2...6 for the top K words


Zipf's diagram for word beginnings


Gnuplot diagram

K # of bigrams # of trigrams # of 4-grams # of 5-grams # of 6-grams
100 99 99 99 99 99
1000 992 999 999 999 999
10000 9579 9966 9996 9997 9997
100000 68716 89884 96885 99013 99641
1000000 131936 193224 204462 207402 208263
921 msec needed at 2022-01-24 06:11